Reassessing the Canon: “fixed” phrases in general reference corpora
نویسنده
چکیده
This paper sets forth the argument for revisiting fixed phrases in the light of the knowledge that their fixedness is not necessarily something to be taken for granted. It focuses on the location and analysis of variant forms in general reference corpora. Existing phraseological structures, including collocational frameworks, idiom schemas and semi-prepackaged phrases, are introduced by way of background before a procedure for retrieving non-canonical forms of fixed expressions in general reference corpora is presented. Some implications relating to the study of variant forms are presented, along with suggestions for future research directions.
منابع مشابه
The quantitative conversion of the component composition of steady-phrases
The Article is devoted to one of the methods of the conversion of fixed expressions (proverbs, sayings, aphorisms and such cliché’ sentences) in the modern Russian language – to reduce their component composition (implizieren, implications). Question quantitative changes in the steady phrases are considered in the aspect of the General problem of phraseological variability. T...
متن کاملLearning Translations of Named-Entity Phrases from Parallel Corpora
We develop a new approach to learning phrase translations from parallel corpora, and show that it performs with very high coverage and accuracy in choosing French translations of English named-entity phrases in a test corpus of software manuals. Analysis of a subset of our results suggests that the method should also perform well on more general phrase translation tasks.
متن کاملAdapting language models for frequent fixed phrases by emphasizing n-gram subsets
In support of speech-driven question answering, we propose a method to construct N-gram language models for recognizing spoken questions with high accuracy. Question-answering systems receive queries that often consist of two parts: one conveys the query topic and the other is a fixed phrase used in query sentences. A language model constructed by using a target collection of QA, for example, n...
متن کاملA Continuum-Based Approach for Tightness Analysis of Chinese Semantic Units
Chinese semantic units fall into a continuum of connection tightness, ranging from very tight, non-compositional expressions, tight compositional words, phrases, and then to loose more or less arbitrary combinations of words. We propose an approach to measure tightness connection within this continuum, based on document frequency of segmentation patterns in a reference corpus. A variety of corp...
متن کاملRecognition of non-domain phrases in automatically extracted lists of terms
In the paper, we address the problem of recognition of non-domain phrases in terminology lists obtained with an automatic term extraction tool. We focus on identification of multi-word phrases that are general terms and discourse function expressions. We tested several methods based on domain corpora comparison and a method based on contexts of phrases identified in a large corpus of general la...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006